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The algorithm described in [1] for estimating' the optimum combining 
weights for the Ka-band (33.7-GHz) array feed compensation system is compared 
with the maximum-likelihood estimate. This provides some improvement in per- 
formance, with an increase in computational complexity. However, the maximum- 
likelihood algorithm is simple enough to allow implementation on a PC-based com- 
bining system. 


I. Introduction 

We consider the problem of estimating combining 
weights for a signal received by an antenna array. The 
signal is modeled as a Gaussian random variable, and in- 
dependent Gaussian noise is added in each channel. An es- 
timation method that has been proposed is treated in [1]. 
Here we compare that method with the method that uses 
maximum-likelihood (ML) estimates of the pertinent pa- 
rameters. The computations required for these estimates, 
while more complex than the computations of [1], are well 
within the capabilities of a small on-site computer. 


where the h k (i) and a(i ) are independent complex Gaus- 
sian random variables, h k (i) is N(0,2<T k ), and a(i) is 
N( 0, 1). Then 1 

c jk = E(fj(i)r k (i)) = 2cr k Sjk + SjS k (2) 

If C is the complex I< x K matrix with entries Cj k , 
then the real and imaginary parts of the f k (i) (for k = 
1 ,,K, i fixed) have a 2A'-dimensional distribution with 
density 2 


II. The Maximum-Likelihood Equations 

The received signal in the fcth channel at time i is as- 
sumed to be 


Pi = 


ir K det(C) 


■=— exp 


K 


j,*=l 


( 3 ) 


1 The overbar denotes a complex conjugate. 

f k (i) = S k a(i) + h k (i), k = 1 , 2, . . . , K (1) 2 The arguments f, (t), .... f K (i) of p, are not shown explicitly. 
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It is convenient to introduce the quantities 


T k 


Sk 

k 


t l 

Ojk = J^2rj(i)r k (t) 


i= 1 


(4) 


It follows from Eq. (9) that 


(ii) 


Then 


cjk = 2 <Tj<r k ($jk 4- Tjfk') 


(5) 


and the elements of the inverse matrix are 


K 

A = -]T ,n (2iral)-\n (y) 

k-1 


1 K 1 
--V — 


&kj 



( 12 ) 


bjk = (C~ l ) jk - ^- k (bjk - U) T k ) 


where 


T = 


K 


i + Ei^i 2 

k=i 


Also, we have 


det (C) = 2 K 



Using these values in Eq. (3), 


( 6 ) 

III. Cascaded Maximum-Likelihood 
Estimates 

For maximum-likelihood estimates 5* and &k, or equiv- 
alently Tk and < 7 *, we need to solve the equations obtained 

(7) by setting the derivatives of A equal to zero. This sys- 
tem of equations must be solved iteratively. It need not 
have a unique solution, for the parameters are not even 
determined by the statistics of the signals [Eq. (1)] un- 
less at least three of the Sk are nonzero. For this rea- 
son, this approach is not pursued here. We assume that 
the < 7 * are estimated from separate observations with the 

(8) antenna pointed “off source.” These noise estimates are 
themselves maximum-likelihood estimates obtained from 
the noise samples by differentiating Eq. (12) with respect 
to <jj (assuming Tk =0): 


Pi ~ 


(2ir)*(n f=X)' 


x exp 








(Tj(Tk 



where ajj is given by Eq. (11). The maximum-likelihood 
estimates of the T* use these noise estimates. 

Differentiating Eq. (12) with respect to Tj, we get 


(9) 


We define the likelihood function as 


K 


1 


-1-1 y 

y 2y 2 0 -jfcCr, 

i ' k,m—\ 


-a krn f m f k Tj 


A = jT> lnpi 

i = 1 

In terms of the sample covariances 


( 10 ) 



djbjffc = 0 

<Tj(Tk 


It can be shown that this is equivalent to the simpler con- 
dition 
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Gj — Gj + 8a j 


(17) 


K 


E 


Qjbj 


T k = 2 7 T} 


Replacing the parameters by their estimates, we get 


Then the estimates Tj are close to the true values, 


K 


E Pi-f k = 2jTj 

t a *°i 


(13) 


This equation states that the complex K vector with com- 
ponents T k is an eigenvector of the matrix composed of 
the elements a k j /(a k aj), with eigenvalue 27 . If we replace 
the matrix elements by their mean values, using the true 
values of the a k , then this matrix has K — 1 eigenvalues 
equal to 2 , and one larger eigenvalue 27 , corresponding 

to the eigenvector T k . Hence, the estimates T k can be 
found in terms of the a k by numerically finding the largest 
eigenvalue of the matrix in Eq. (13) and its eigenvector. 
The eigenvector must be scaled so that the eigenvalue 27 
satisfies the relation [from Eq. (7)] 


Tj^Tj+STj 


(18) 


Expand Eq. (15), keeping only those terms which are 
of first order in the deviations 8a j k , 8a y, and 8Tj. We get 


E 


^-f t + 

Ja k aj 


Ckj 

2 <T k <Tj 


m- 


2a k aj \ aj a k ) 


K 

j6T 3 + f ; £ (f k ST k + T k 6T k ) 

k = 1 

Using the formula in Eq. (5) and simplifying, the result is 


K 


7 = i + ^inp 


(14) 


k= 1 


K 


(7-1 )Sf j +f j J2f k ST k = 


k = l 


The method for solving Eq. (13) is described briefly in 
Appendix A. 


K 


8a 


kj 


K 


tl 2(Tk(T l 


T k 


T, 

k-l k 


IV. Variance of the Cascaded ML Estimates 
for the Tj 

For a large L ) the sample covariances aj k are close to 
their mean values: 

Qjk — Cjk T 8a j k (15) 

where the difference 8aj k has a mean of zero and a small 
variance. It is easily shown that 


E{8aj k 8ai m } — 

X (<5 im + nf m ) [s tj + T,f. )) (16) 
If the estimates frj are close to the correct values, 


(19) 

If we multiply this equation by 8Tj and sum over j } we 
get 

K __ 

(7-1 )^2(fkSfk + T k Sf k ) = 

k= 1 X 



This equation determines the real part of Y^T k 8T k . The 
imaginary part of this sum is undetermined, since the T k 
can be multiplied by an arbitrary common complex factor 
of absolute value 1 . From Eq. (20), we can set 
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k = l 


Square Eq. (22) and take the expected value. We de- 
note the deviation of Eq. (22) by the prefix <5 0 distin- 
guish it from that obtained by other methods below. Using 
Eqs. (16) and (23), the result is 


£(M)I 2 ) 


(t- i) 




7-1 — |T ; f +i 7 |7)| 2 


where w is a real quantity not yet specified. Using this 
expression in Eq. (19), we get 


+ 


AM 




K 




4 2 ^tl|7)| 4 


' 7-1 






Tic 


^ 2^0-,- 2( 7 - 1) 


+ (7 + T) 2 \Tj\ 2 


(24) 


K 


x E 


yfV rfl 

m-L k 


m } k~[ 


2ak(Trr 


K 


+ ^vE— fii 2 
7 “ 1 ** 


-( 7 + l)^f i ->f i 


( 22 ) 


In the following, we will take w — 0, since this turns 
out to give the best results. The estimate <r ; , found with 
the signal absent, is 


Expanding as above, 


& (T j Sdjj 

<Tj ~ 4 a] 


The sum over j of this expression leads to 


K 


Y,E(\6oT j \ 2 ) = 

j = 1 


1 f 1 r rr , 1 1 1 

-T\l 7 [ A - 1+ 4 7 J + 4 M 


K 


X (7+ l ) 2 - 


2 2 7 + 1 |^. |4 

(7 - 1) 2 


£|r*r 


k = 1 


(25) 


Now we consider another estimate, where the first chan- 
nel is taken to be the one with maximum signal strength, 
and §1 (or T\) is estimated first. By Eq. (5), 

£(a„) = 2«7 2 (l + |fi| 2 ) 

This leads to the estimate 


i^Dr 1 


(26) 


Using Eq. (16) (with no signal), we get 


The resulting error £ 2^1 has variance 


(23) 


£ (i«i i )=(i + ¥) ii iilr ! < 27 -> 


where M is the number of samples used in the noise esti- This estimate can be used to obtain an estimate for Tj for 
mates. j > 2 by using 


101 



O' > 2) 


Tj = 


a i j 


2<xi crjT x 


(28) E( \6iwtf) = + |f, | 2 ) 2 + 4(1 - 00 


The variance of this estimate is 
E{\S 2 fj\ 2 ) = 


1 

4£|Ti| 4 . 


If; 


| 2 (l + |f 1 | 2 ) 2 + 4|f 1 | 2 (l + |f 1 | 2 ) 


+ 


|fj| 2 

4M|7)| 4 


(l + |fi| 4 ) 


(29) 


x (1 + 4<5ji|fi| 2 + 4|fi| 4 ) (32a) 


A modified form of this last method was considered in 
[1]. There, T\ was estimated from Eq. (26) based on N 
samples, and the other Tj were estimated by Eq. (28) for 
a later set of L samples. For this method, it can be shown 
that 


V. Variance of the Combining-Weight 
Estimates 

We now consider estimates for the weights 



obtained from the cascaded ML estimates of the signal-to- 
noise ratio (SNR) and noise parameters. The estimate 



(30) 


has a deviation Sibj from the true value, given to the first 
order by 


bwj 


\/2 <Tj 




The mean-square value of this deviation can be computed 
for each of the estimation methods under consideration. 
We find 


£(|Mbf) = 


2(t-1) 2 <t 2 1 L 




AM 


K 


£ i f *i 4 - ^rrfil 4 + 


(31) 


EQSwjl 2 ) 


|f j | 2 (l + |f 1 | 2 ) 2 

8Naj\Ti | 4 




(l+lf.pKl + lf;! 2 ) If; I 2 

2Laj\fi | 2 8M<r?|f,| 4 

x (l + 4^ 1 |f' 1 | 2 + 4|f 1 | 4 ) (32b) 


These values, divided by |ujj| 2 , are plotted in Fig. 1. 
The values \T\\ 2 — 0.05 and \Tj | 2 = 0.005, j > 2, were 
used. M was fixed at the value 100,000, and N = L. There 
is no observable difference between the curves based on 
Eqs. (32a) and(32b). For a small L, the mean-square error 
from the maximum-likelihood formula is lower by 2.2 dB. 
For a large L, the other methods are better, but this is in 
an impractical range of the parameters. M should be at 
least as large as L, since the M samples provide noise esti- 
mates on which the subsequent estimates are based. The 
failure of the maximum-likelihood method in this range, 
which was applied only for estimating the Tj y shows that 
these estimates can be more strongly affected by errors in 
the noise estimates. 


Figure 2 shows the maximum-likelihood curve for var- 
ious values of M . These curves have the same general 
appearance as the corresponding curves for the method 
given in [1]. Again, on the right side of the figure, where 
the curves for various M are widely separated, the errors 
shown are higher than those shown in [1]. The significant 
points on these curves, with M > L } show an improvement 
over [1]. 
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VI. Joint Maximum-Likelihood Estimates for 
the Equal Noise Case 

An important special case which is seen in practice has 
all the <Tj equal. Here maximum-likelihood estimates can 
be used to simultaneously determine the noise and signal 
levels (the problems cited above no longer apply). 

Denote the common value of the Oj by a. The formula 
in Eq. (12) for A becomes 

A = —K In (27 ra 2 ) - In ( 7 ) 


where w is again an undetermined real quantity which will 
be set equal to 0. The noise error 6a can be found from 
Eq. (34). Eliminating STj by the use of Eq. (36), the result 
is 


6a 1 

~7 = 2 (I< - 1) 


x 


^ 2 ' 2 



E 


6 ^ km 

2 a 2 


T m Tk 


(37) 


K 


- E 


®kj 

2a 2 



To get variance estimates, square the expressions in 
(33) 

Eqs. (36) and (37) and take the expected value. Denoting 
the deviation in 7} by the prefix <5 4 , we get 


To find the noise and signal amplitudes simultaneously 
by the maximum-likelihood method, set the derivative of 
A with respect to a equal to zero, and solve this equation 
together with Eq. (13). Simplifying the derivative by using 
Eq. (13), the equation obtained is 


E(\hTj\ 2 ) = 


(7 - 1) : 


{z 7 


\fi\ 2 + 


IK 

4jf^T r|r ' 1 



K 

2a 2 {K- I + 7 ) = E 5ti 

k~\ 


(34) 


6a 2 \ 1 

a 2 ) ~ 4 L(I< - 1 ) 


(39) 


Eliminating a from Eq. (13), we get 


Summing Eq. (38), 


K 


K 


E 5 ‘> T * - K- 1 + 7 \£[ H Tj 


(35) 


K 




I< 


1 + T 


K 


4 A' - 1 


(40) 


As before, this equation is solved by taking the vector with 

components Tj to be the eigenvector of the matrix (a k j) 
corresponding to the largest eigenvalue. This eigenvalue 

must be the coefficient of Tj on the right, which determines 

7 , and hence determines the Tj up to a common complex 
factor of absolute value 1. The noise estimate is then given 
by Eq. (34). 

To get variance; estimates, proceed as before. The equa- 
tion analogous to Eq. (22) is 


K 




Jfc=l 


2a 2 


K -1-7 f 
2(K - 1)( 7 - 1) ; 


E fia/cm ™ rp 

~2^ TmTk 


7 Tj 


K 


2(A - 1 )fr[ 2a 2 


E B-W (36) 


Using the above formulas with Eq. (30), 


E(\6 4 wj\ 2 ) 


1 

2 <r 2 ( T - l ) 2 


x 


7(7 - 1 - \Tj\ 2 ) + ^7 2 P)l 2 


+ 


(27 -l ) 2 
4(A' - 1) 



(41) 


Vil. Performance of the Weight Estimates 

We now consider the combined signal, formed by taking 
the weighted sum 
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K 

z(i) = y^wtrt(t) (42) 

k = 1 

where the weights are based on data before time i. If we 
use w k = w k + 6w k and express r k (i) by Eq. (1), we get 
four terms: 


(2) Method 2: the method based on Eqs. (26) and (28). 

(3) Method 3: the modification of method 2 described 
in Section V. 

In method 3, T\ is estimated from Eq. (26) based on N 
samples, and the other Tj are found from Eq. (28) using a 
later set of L samples. This is the method treated in [1]. 


z(i) 


K K 

E w k S k a(i) + E 


fc=i 


k = \ 


K K 

+ E ^a(i)Sw k -|- E 

Jfc=l fc=i 


K 

= E «*&2(i) + + £ 2 + £3 (43) 

fc=l 


The first term is the desired signal. The other terms are 
contributions to the error which can be considered sepa- 
rately, since their cross products have zero expectation. 

Averaging over values of the current signal first, we have 


K 


£(|£i| 1 2 ) = £ 2 <r*W=7-l (44) 


*=i 


E(\E 2 \ 2 )= -e 


K 


E 2(rkTkbw k 


k~\ 


(45) 


The quantity E(\E 2 \ 2 + \E^\ 2 )/\ w k S k \ 2 , the relative 
mean-square error caused by weight errors, is plotted for 
these three methods in Fig. 3. The formulas used are given 
in Appendix B. The values are plotted as a function of 
the signal- to-noise ratio, with the parameters L, M t and 
N fixed. (Throughout the curves, we take the SNR in 
every other channel to be one- tenth as large as the SNR 
in channel 1.) It is seen that method 2 for N = L is 
better than method 3, although the difference is small at 
a low SNR. (For N » L , method 3 would be better 
than method 2.) Of course, the ML value (method 1) is 
smallest, but the difference from method 2 is small at high 
SNR. 

If we assume all channels have the same noise level, then 
method 4, the maximum-likelihood method of Section VI 
can also be considered. Comparing method 4 with the 
first three methods, it is seen that those methods can be 
improved by using a unified single noise estimate from the 
off-signal data. These improved methods are denoted by 
1', 2', and 3'. The relative error for these four methods 
is shown in Fig. 4 (formulas are given in Appendix B). 
The behavior shown in Fig. 3 occurs here again, with the 
curves at a lower level, and method 4 is slightly better 
than method l f . 


E(\E 3 \ 2 )= E^2a 2 \6w h \ 2 j (46) 

These quantities are easily evaluated by using previous 
formulas. 

The following estimation methods are referred to in 
Figs. 3 and 4: 

(1) Method 1: the maximum-likelihood method of Sec- 

tion I. 


VIII. Conclusions 

Maximum-likelihood methods for combining weight es- 
timation provide a consistent decrease in the mean-square 
error of the combined signal, as compared with other esti- 
mation methods, at the cost of a small increase in compu- 
tational complexity. The part of the error which is caused 
by weight errors is decreased by over 2 dB, provided that 
at least as many samples are used to estimate the noise 
variance as the T k . This can reduce the number of sam- 
ples needed for equivalent performance to 30 percent less 
than the number needed by method 3. 
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RELATIVE MEAN-SQUARE ERROR RELATIVE MEAN-SQUARE ERROR 



NUMBER OF SAMPLES, L 

Fig. 1. Relative errors of the weight estimates. 



SNR IN FIRST CHANNEL 

Fig. 3. Relative errors in the combined signal caused by weight 
errors (independent noise variances). 
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Fig. 4. Relative errors in the combined signal caused by weight 
errors (equal noise variances). 






Appendix A 

Method for Solving Eq. (13) 


The mathematical problem posed by Eq. (13) is the 
following: Given a K x K Hermitian matrix A, find its 
maximum eigenvalue and the corresponding eigenvector. 

If the maximum eigenvalue is also the eigenvalue of 
maximum absolute value, then this can be accomplished 
by an iterative procedure. Choose any convenient starting 
vector xo- 

If we take x n = Ax n _i for n > 1, then x„, when 
normalized, approaches the eigenvector which is sought. 
Choose positive numbers c n so that 


Ax n _! 



ha s unit length. Then x n approaches the eigenvector and 
c n approaches the eigenvalue. 

The rate of convergence of this procedure depends on 
the size of the next largest eigenvalue, as compared with 
the first. In our application, the convergence is slow for low 
SNR. However, the method can be modified to speed up 
the convergence. When suitably modified, the difference 
between x n and the eigenvector decreases by a factor of 
the order of (7 — l)/y/Z at each step. 


107 


Appendix B 

Formulas Used for Figs. 3 and 4 


The formulas used for Figs. 3 and 4 are presented here. 
For method 1, 


E(\E 2 \ 2 + \E 3 \ 2 ) 


t(?T 2 + K ~ *) , 7 

1(7 - 1) M ( 7 - 1) 


1 — 87 + 47 2 

4(7 - l) 2 


K 


Ei^r+7 


(B-l) 


For method 2, 


E(\E 2 \ 2 + \E 3 \ 2 )=^ 


^T|r(l + |T ' 1 | 2 ) 2 + (/F-l) 


1 + Itxl 2 

mi 2 



7(7 - 1 ) 
4|Ti| 4 



K 

+ 2 + 2 | f 1 | 2 + £| T *| 4 

k-\ 


(B-2) 


For method 3, 


E(\E 2 \ 2 + \E 3 \ 2 ) = ±7(7-1) 


0 + ml 2 ) 2 

N |Ti| 4 



K - 2 + 7 2 

mi 2 


+ K - 7 - 


1 



7(7 ~ 1) 

4|Ti | 4 



K 

+ 2 + 2|f 1 | 2 + ^|T,| 4 

k=l 


(B-3) 


For method 4, 


E(\E 2 \ 2 + \e 3 \ 2 ) = 


7 

L ( 7 - 1) 



+ A' - 1 + 


(7-j) 2 ! 


(B-4) 


For method 1', 


E(\E 2 \ 2 + |£ 3 | 2 ) = 


7(j7 2 + 1) 

£(7-1) 1 


7(7 -I ) 2 
MK(j - 1) 


(B-5) 


For method 2', 


E(\E 2 \ 2 + |£ 3 | 2 ) = | 


7(7 - 1 ) 
4|Ti| 4 


(l + |Ti| 2 ) 2 + (A'-l) 


l + mi 2 

mi 2 


1 

+ MK 


' 7(7 ~ 1) _ 7 2 ~ 1 

. 4|f!| 4 |TiP 


+ 7 2 + 7 + 1 


(B-6) 
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For method 3', 


E(\E 2 \ 2 + \E 3 \ 2 ) = p(y- 


(1 + mi 2 ) 2 1 

iV|fi | 4 i 


' A '-2 + 7 2 

. mi 2 


+ K - 7 - 1 


1 

+ MK 


t(t~ 1) 

. 4|Ti| 4 



+ 7 2 + 7 + 1 


(B-7) 
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